Optimization with Gradient-Boosted Trees and Risk Control
نویسندگان
چکیده
Decision trees effectively represent the sparse, high dimensional and noisy nature of chemical data from experiments. Having learned a function from this data, we may want to thereafter optimize the function, e.g., picking the best chemical process catalyst. In this way, we may repurpose legacy predictive models. This work studies a large-scale, industrially-relevant mixed-integer quadratic optimization problem involving: (i) gradient-boosted pre-trained regression trees modeling catalyst behavior, (ii) penalty functions mitigating risk, and (iii) penalties enforcing composition constraints. We develop heuristic methods and an exact, branchand-bound algorithm leveraging structural properties of gradient-boosted trees and penalty functions. We numerically test our methods on an industrial instance.
منابع مشابه
Gradient Boosting With Piece-Wise Linear Regression Trees
Gradient boosting using decision trees as base learners, so called Gradient Boosted Decision Trees (GBDT), is a very successful ensemble learning algorithm widely used across a variety of applications. Recently, various GDBT construction algorithms and implementation have been designed and heavily optimized in some very popular open sourced toolkits such as XGBoost and LightGBM. In this paper, ...
متن کاملTF Boosted Trees: A Scalable TensorFlow Based Framework for Gradient Boosting
TF Boosted Trees (TFBT) is a new open-sourced framework for the distributed training of gradient boosted trees. It is based on TensorFlow, and its distinguishing features include a novel architecture, automatic loss differentiation, layer-by-layer boosting that results in smaller ensembles and faster prediction, principled multi-class handling, and a number of regularization techniques to preve...
متن کاملEvent Nugget Detection, Classification and Coreference Resolution using Deep Neural Networks and Gradient Boosted Decision Trees
For the shared task of event nugget detection at TAC 2015 we trained a deep feed forward network achieving an official F1-score of 65.31% for plain annotations, 55.56% for event mention type and 49.16% for the realis value. For the task of Event Coreference Resolution we prototyped a simple baseline using Gradient Boosted Decision Trees achieving an overall average CoNLL score of 70.02%. Our co...
متن کاملConvexity-Concavity Indicators and Automated Trading Strategies Based on Gradient Boosted Classification Trees Models
This paper uses the visibility and invisibility algorithms to build the peak and trough indicators, providing a way to recognize the convexity, concavity and regime change of the CSI 300 Index from the April 8, 2005 to June 30, 2016. The study found that the automated trading rules discovered by the gradient boosted classification trees models derived from the peak indicator outperform that fro...
متن کاملGlobal Optimization in Learning with Important Data: an FCA-Based Approach
Nowadays decision tree learning is one of the most popular classification and regression techniques. Though decision trees are not accurate on their own, they make very good base learners for advanced tree-based methods such as random forests and gradient boosted trees. However, applying ensembles of trees deteriorates interpretability of the final model. Another problem is that decision tree l...
متن کامل